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ABSTRACT 

We present the clustering of galaxy clusters as a useful addition to the common set of cosmo- 
logical observables. The clustering of clusters probes the large-scale structure of the Universe, 
extending galaxy clustering analysis to the high-peak, high-bias regime. Clustering of galaxy 
clusters complements the traditional cluster number counts and observable-mass relation anal- 
yses, significantly improving their constraining power by breaking existing calibration degen- 
eracies. We use the maxBCG galaxy clusters catalogue to constrain cosmological parameters 
and cross-calibrate the mass-observable relation, using cluster abundances in richness bins 
and weak-lensing mass estimates. We then add the redshift-space power spectrum of the sam- 
ple, including an effective modelling of the weakly non-linear contribution and allowing for 
an arbitrary photometric redshift smoothing. The inclusion of the power spectrum data allows 
for an improved self-calibration of the scaling relation. We find that the inclusion of the power 
spectrum typically brings a ~ 50% improvement in the errors on the fluctuation amplitude erg 
and the matter density Q.,„. Finally, we apply this method to constrain models of the early 
universe through the amount of primordial non-Gaussianity of the local type, using both the 
variation in the halo mass function and in the cluster bias. We find a constraint on the amount 
of skewness /nl =12 + 157(lcr) from the cluster data alone. 
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1 INTRODUCTION 

Galaxy clusters are the most massive bound systems in the Uni- 
verse which trace the ev olution of the lar ge-scale structure (LSS) 
(see the recent review bv lAUen et al.l20lll) . The initial density per- 
turbations are though t to have formed in t he early universe from 
inflationary physics jUvth & Liddld I2OO9I) . In the simplest sce- 
nario, the perturbations can be modelled as Gaussian random fields 
teardeen e t al. 1986), which evolve gravitationally. This leads to 
the formation of bound dark matter structures - the haloes, whose 
abundance is described by the halo mass functio n. The simplest 
infall formation model jPress & Schechtetlll974l) is complicated 
by dynamical effects, meaning that acc urate modelling of the 
mass function requires partial calibra tion (Sh eth & To rmen 1 9^ ; 



iBorgani & Kravtsov|[2009l. for a review). Nonetheless galaxy clus- 
ters form at a comoving scale of ~ 10/?"'Mpc, allowing for a sim- 
pler theoretical description than is possible for smaller structures 
such as galaxies. Due to their scale, clusters reside in the tail of the 
halo mass function and thus their numbe rs are exponentially sensi- 
tive to variations in cosmology (see e.g.. lEvr^dlll989l : lFrenk et"al] 
1 1990: Bahcall etai]|l997h . 

Clusters are detected across multiple wavelengths with vary- 
ing degrees of success. A few dozens have been found in 
the millimeter by the Atacama Cosmology Telescope (ACT) 
jMenanteau et af 20121). hundreds w ith the South Pole Tele- 



iMaggiore & Riottdl20 1 d; ICorasaniti & Achitouvl l20 1 ih or full fit- 



ting JJenkins et al 



2001 



[Tinker et al.i i2008i) to A^-body simula- 
tions. Halo mergers and internal processes such as galaxy for- 
mation further complicate the picture at small scales (e.g., see 



scope (SPT ) (Reichardt et al.' '2012|) and the Planck satellite 
jPlanck Collabo ration et al. 201 1.): also a few hundreds of clusters 
ha ve been found in the X-ray (REFLEX, BCS, eB CS catalogues 
bv lBohringer et al.ll2004 lEbeling et al.lll998L |2000|) using the the 
ROSAT satellite All-Sky Sur yev dVoges et al .11 19991). the Chandra 
Cluster Cosmology Project jBurenin et al. 20071 : vikhlinin et al.l 
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l2009ah . and by iMehrtens et all (1201 2|) using X-ray Multi-Mirror 
Mission Newton i Fassbendei 2008h . Many tens of thousands have 
been found in the optical using S loan Digital Sky Surv ey (SDSS) 
data t o construct the maxBCG jKoester et al. I l2007bh and GM- 
BCG ( iHao et alj2010l) catalogues, based on the selection of bright- 
est cluste£_galaxies (BCG) to identify the clusters' centres, and 
bv lGladders & Yed ( l2005h using the Canada-France-Hawai'i Tele- 
scope and Cerro Tololo Inter- American Observatory. 

While detectable in large numbers, the main obstacle with us- 
ing optical clusters as probes of cosmology is the difficult choice 
of a low-scatter m ass proxy. Efforts on this front have been led by 
IRozo et alj hoXfh and lZu et al. who derived cosmological 

constraints from the maxBCG cluster sample. The tightest scaling 
relation between ob servable and clus ter mass comes from X-ray 
data (< 10% scatter. IXllen et al.ll2008h . Constraints on dark energy 
with ~ 20% uncertainty were obtained from X-ray cluster samples 
studied by Mantz et al. (2008, 2010) and Vikhlinin et al. (20093)- 
Data on the cluster masses obtained from weak-lensing analyses 
of background galaxies have also been combined with the num- 
ber count s to improve the constraining power of the cluster mass 
funct ion djohnston et"ai]|2007l : lsheldon et al1l2009l : lMahdavi et al.l 
l2007h . The statistics o f rare event s in the high-peak, high-ma ss limit 
has also been used bv lHotchkisl j201 ih . lHovle et alj BoH) to test 
cosmology. 

The uncertainty in the scaling relation is one of the biggest 
obstacles in usi ng galaxy clu s ters a s cosmological probes, a s 
pointed out by iHaiman et alj j200lh . iBattve & Welleil j2003h . 
iMaiumdar & Mohil ^IQQ^ suggested to use the clustering of clus- 
ters as a complementary probe. So far, only limited efforts have 
been dedicated to the measure ment of the clustering properties 
of galaxy clusters: iHtitsil ( |2010|) measured the power spectrum of 
maxBCG clusters resu lting in weak detection of baryon acous- 
tic oscillations (BAO), lEstrada et al. 1 20091) measured the corre- 
lation function for the same, and " iHong et alj ( 1201 2h measured 
the correlation function of extended versions of maxBCG. Finally 
iBalaguera-Antolmez et al. l|) measured the power spectrum of 
the REFLEX X-ray cluster catalogue. The goal of this paper is to 
fully include the clustering information in the cosmological anal- 
ysis of cluster data: we show that its inclusion significantly im- 
proves the cosmological constraints, and also reduces the degenera- 
cies between the scaling relation nuisance parameters. We present 
the improved cosmological results obtained in this way from the 
maxBCG data. 

As an interesting application, we present the constraining 
power of these data on the amount of primordial non-Gaussianity 
(PNG) of the initial density perturbations, which is expected to be 
produced in some models of the early universe. Briefly, while the 
simplest single-fie ld slow-roll inflation produces nearly-Ga ussian 
initial conditions dMaldacenj [20031 : lAcquaviva etai]|2003h . there 
exist alternatives, such as multi-field models, which ca n pro- 
duce large non-Gaussianities (see e.g. the recent review bv lChenI 
These would have multiple observable consequences, of 
which we here consider two: the halo mass function changes 
as a function of the non-zero skewness ( Matarrese et al.l 2000l; 
Lo Verde et al. l2008l : |Pillepichetal]|2010l : | Achitouv & Corasanitil 



2012a| |bl). and in the local and orthogonal cases the halo bias 
becomes strongly scale-dep endent due to the cou pling of long- 
and short-wavelength modes dDalal et al. 2008: Matarrese & Verde 



and snort-wavelengtn modes (IJJatat et al. /(JUS: Matarrese & Verde 
2008| : ISlosar et al.l2008l:lAfshordi & T ollev 20 Q8l:lDesiacgues et al 



2OO9I: iGian nantoni o & Porcianill2010l: iDesiacQUfir& SeliakI 120101 : 



tionary models dBartolo et aLll2004 ISuvama et all2010h . The lat- 
est constraints on the local PNG parameter /nl from the bispectrum 
of the cosmic microwave background (CMB) as m easured by the 
WMAP satellite are -3 < /nl < 77 at 95 % c.l. dHinshaw et"aLl 
l2012l : lBennett et al.ll2012h : comparable bounds have been obtained 
from the LSS usin g multiple galaxy catalogues dSlosar et al.ll2008l: 



Ross et al]|2013l : IGiannantonio et al 



Afshordi & Tollev 2008; Xia et al. 2010a bl I2OI ll : ISefusatti et all 
2OI2I : ' 



l2013h : future galaxy 



Schmidt & Kamion kowskil 20 id : Desiacgues et alj 201 ih . Mea- 



surements of PNG can potentially rule out entire classes of infla- 



su rveys such as Euclid are expected t o reac h an accuracy of A/nl ~ 
3 dCiannantonio etalJl2012l) . TOgurild2009l) suggested that measur- 
ing the varianc e of cluster counts c an yield significant constraints 
on PNG, while lSartoris et aLld2010h showed in principle how such 
constraints can be improved by using the cluster power spectrum. 
Here we constrain PNG with the combined cluster data. 

The paper is organised as follows. In Section |2] we describe 
measurements of the cluster abundances, weak lensing mass es- 
timates and power spectrum of the maxBCG catalogue. In Sec- 
tion [3] we introduce the theoretical framework including number 
counts and total mass determination from the mass function, mass- 
observable relation, bias and power spectrum definitions, and the 
effects of non-Gaussian initial conditions. Section |4] presents our 
Monte Carlo Markov Chain (MCMC) analysis, the cosmological 
constraints and the relevant degeneracies. We draw our conclusions 
in Section|5] We assume that the Universe is spatially flat on large 
scales throughout. 



2 DATA 

2.1 The maxBCG cluster catalog 

The maxBCG catalogue dKoester et alj l2007al ) is a sample of 
13,823 galaxy clusters compiled from SDSS photometric data. The 
catalogue is assembled by selecting the brightest cluster galaxy and 
applying a red- sequence method to identify cluster members in its 
neighbourhood. In this way clusters with richness (number of mem- 
ber galaxies) ranging from 10 to 190 are selected. The low mass 
limit of this sample is Mun, ~ 7 - IO'^A^'Mq, which evolves weakly 
with redshift. This relatively low mass limit results in a sample that 
is significantly larger than other current galaxy cluster catalogues. 
The clusters are chosen in an approximately volume-limited way 
from a 500 Mpc^ region, covering ~7500 deg^ of sky with a photo- 
metric redshift (photo-z) range of 0.1 < z < 0.3. The photo-z eiTors 
are small and of the order of Az = 0.01. An analysis of mock sam- 
ples shows that the maxBCG algorithm results in more than 90% 
purity and m ore than 85% comple teness, for clusters with masses 
M > IO"*Mo dKoester et ai]|2007 j) . 

We define the richness N^^\ as the number of red galaxies 
within the radius /?200 from the cluster centre. /?200 is the radius 
within which the average overdensity is 200 times the mean density 
of the Universe. The catalogue is divided into nine richness bins in 
the range 11 < A'gai < 120, which approximately corresponds to 
7- 10"r'Mo < M < 1.2- lO'^r'Mo (Rozo et al. 2010). We found 
that adding the five remaining high-mass clusters of the maxBCG 
with richness A'gai > 120 has a negligible impact on the cosmolog- 
ical analysis, so we do not include them. We also use an additional 
bin at 9 < A'gai ^ H (Rozo, private communication), although 
we checked the results are not affected by this. For the cosmologi- 
cal analysis we inclu de Poisson errors and s ample variance due to 
large-scale structure dHu & Kravtsovll2003l> . Furthermore, we as- 
sume 100% purit y and completeness, including a 5% uncertainty 
dRozo etal.ll2010h . which we add in quadrature. We found that the 
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Figure 1. Top panel: MaxBCG cluster counts data (black points) and the- 
oretical predictions according to the prescriptions of Section [3]for a choice 
of different cosmologies (without primordial non-Gaussianity). The red hne 
represents the best-fit model to our full data set (counts, total masses and 
power spectrum). Bottom panel: Mean masses of maxBCG clusters (black 
points) and theoretical predictions for different cosmologies, as above. 



Figure 2. The observed power spectrum of maxBGC clusters (black points) 
compared with the full theoretical modelling P"*"^ described in Section|3]for 
our best-fit model (red solid line), and for two other models, assuming no 
primordial non-Gaussianity. The dotted line aXk = 0.15 h Mpc"' represents 
our choice of ^max, which is the smallest scale we use in the analysis. 



photo-z errors have a negligible impact on the number counts anal- 
ysis presented here, so we neglect their effect on the number counts 
covariance matrix. 

We show in the top panel of Fig. [T] the counts data, together 
with the predicted counts for a selection of different cosmologies, 
modelled as described in Section[3] 



2.2 Galaxy cluster masses from weak lensing observations 

ISheldon et all ( |2009|) measured the weak lensing (WL) effect from 
clusters in the maxBCG catalogue. By stacking the clusters, mean 
cluster surface density profiles were created for different luminos- 
ity and richness bins. The stacking of clusters in a given rich- 
ness bin improves the signal-to-noise considerably c ompared to the 
measu rement of the profile of an individual cluster. Ijohnston et al.l 
( l2007h used these profiles and reconstructed mean 3D cluster den- 
sity and mass profiles, which allows one to estimate the mass 
(and concentration) of clusters in a given redshift bi n. For this re- 
const ruction a Navarro-Frenk-White (NFW) profile ([Navarro et al] 
1 19971) for the cluster density was assumed. Ijohnston et al.l ( |2007| ) 
were then able to construct a mean mass-richness relation, finding 
for the whole sample of groups and clusters 



M2oo(A'gai) 



;x 10'^^r'Mo(A'gai/20)'-^^ 



(1) 



where M200 is the mass contained within the radius i?200- Due 
to photometric redshift bias these masses are corrected upward s 
by a factor of 1. 18 as described in Mandel baum et al] ( l2008bh . 
IRozo et al j201(]|). For the cosmological analysis presented here 
we follow Rozo et al] ( 120101) and fit simultaneo usly for the mass- 
richness relation using the Ijohnston et al.l (|2007|) data and their er- 
rors. We use five richness bins for this, in the range 12 < A'gai < 



300, plus another extra bin at 9 < A'„ai < 12 (Rozo, private commu- 
nication). 

An independ ent weak lensing ana l ysis of the maxBCG sample 
was performed bv iMandelbaum et al.l (l2008al) . who found a mean 
mass d ifference of approximately 6% with respect to ljohnston et al.l 
l l2007t) . We follow iRozoetai] ( I2OI0I) and include this discrepancy 
by introducing an offset factor j8 with a suitable chosen prior as 
described in Section [33] 

The bottom panel of Fig. [T] shows the mean WL mass esti- 
mates data, together with the theoretical mean masses for a selec- 
tion of different cosmologies, modelled as described in SectionjS] 

2.3 MaxBCG power spectrum 

We consider the redshif t-space powe r spectrum of the maxBCG 
sample, as measured by iHiitsil boiol) . For the full details of the 
power spectrum measurement, a long w ith systematics tests, we re- 
fer thereadertojlJutsi (2006a.b'.' 2010l) . The direct Fourier method 
bv lFeldman etali (,1994,) (FKP) was used, with the difference that 
fast Fourier transforms (FFTs) were used instead of direct summa- 
tion. This method actually yields the pseudo-spectrum, i.e. the mea- 
surement products are convolved with the window function of the 
survey. We take this into account when modelling the theoretical 
spectra in our analysis. To implement the modified FKP method, 
the following steps were followed: 

(i) The survey selection function (footprint) was represented us- 
ing a random (unclustered) catalogue with 100 times more points 
than maxBCG sample; 

(ii) The overdensity field was calc ulated on a regular grid usin; 
the triangular shaped cloud method dHocknev & Eastwoodlll98 
mass assignment scheme — it was checked that the aliasing effects 
due to the finite grid size were negligible for the measurements. 
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and w ere nonetheless corrected with the iterative method by iJind 
( l2005h ; 

(iii) The gridded overdensity field was transformed to Fourier 
space using the FFT; 

(iv) The raw 3D power spectrum was estimated by taking the 
modulus squared of the FFT; 

(v) The shot noise contribution was subtracted. 

The uncertainties on the power spectrum measurements were 
estimated with three different methods: with the original FKP the- 
oretical prescription, which assumes Gaussian errors from cosmic 
variance plus a shot noise contribution; with a jackknife method, 
implemented by dividing the survey into a total of 75 regions; and 
with a Monte Carlo (MC) method, based on the fiducial ACDM 
cosmology, in which 1000 mock realisations of the maxBCG sur- 
vey were generated, including redshift-space distortions and photo- 
z errors. These three methods were shown to be comparable; in this 
work we use the MC covariance matrix. 

The power spectrum measurements are shown in Fig. |2] To 
take into account data in the quasi-linear regime only, we restrict 
ourselves to scales larger than (wavenumbers smaller than) k^.^,^ = 
0.15 /iMpc-'. 

2.4 The counts-clustering off-diagonal covariance 

The diagonal blocks of the data covariance matrix for counts, 
masses and the power spectrum have been described above. We es- 
timate the off-diagonal terms of the covariance matrix between the 
clustering and the binned number distributions AA' of the maxBCG 
clusters. For simplicity, instead of the power spectrum we use here, 
as a clustering estimator, the projected correlation function w{6), 
which is defined as 



w(e) = {6H(h)5,{n')), 



(2) 



where 6i,{h) is the halo (cluster) projected overdensity in a direction 
M, and the average is earned over all pairs at an angular distance 6. 

We use the jackknife technique as follows: we split the 
maxBCG f ootprint into 100 e qual-area jackknife regions using 
HEALPpfl lGorski et al.l2005l) and populate the full footprint with 
50 random points for each maxBC G cluster, to reduce sho t noise. 
We use the correlation estimator by iLandv&SzalavId 19931) to cal- 
culate the congelation function, and bin the number of clusters 
within six equal-width bins in log^ space. We iteratively remove 
and replace each jackknife region and calculate the number his- 
togram and correlation function at each iteration. The covariance 
matrix Cjk between measured statistics x = x {a) and y = y {p) can 
be estimated from A' jackknifes using (see e.g. lEfronll 19821) : 




0.30 



CjK(x„,yj) 



N- 1 

A' 



^(x'i,-X,)„(/^.->;,)/5, 



(3) 



Figure 3. The off-diagonal elements of the normalised covariance matrix 
of the correlation function w{0) and the histogram distribution of AA', as 
calculated using the jackknife technique. 



we show the off-diagonal terms of the normalised Cjk[AA', w(0)], 
and note that the average value and Icr error of the off-diagonal 
terms are -0.06 ± 0.10, which is consistent with zero. We observe 
that as the number of jackknifes increases, the mean and error of the 
average value of the off-diagonal terms approaches and fluctuates 
around zero. 

We compare the magnitude of the off-diagonal terms 
obtained from the maxBCG clusters with simulated clus- 
ters from the Millennium Simulation dSpringel et all 120051; 
iLemson & Virgo Consortiu"mll2006l) . Specifically, we join the light- 
cone table of Henriques20 12a . wm apl . BC03_AllSky_00 
dHenrigues et alj |2012| ; Iguo et al.l [201 ih with the halo-tree table 
MPAHaloTrees. .MHalo. We apply the same redshift and sur- 
vey footprint constraints to mimic the maxBCG sample and calcu- 
late the coiTelation function and histogram distribution of AA'. We 
find the data and simulations agree closely: e.g., for 100 jackknifes 
the mean and Icr error of the off-diagonal terms are 0.05 ± 0.12 
from the simulations. 

From these tests we conclude that ignoring the off-diagonal 
covariance matrix between clustering and number counts is a rea- 
sonable approximation. 



where iy-j) is the statistic with jackknife region ; (j) removed, 
and X (y) is the average value of all x_, (>'-,). We note that typically, 
but not necessarily, x and y are the same statistic. 

We compare the square root of the diagonal elements of the 
covariance matrix Cjk[w(6), h'(6)] with the error expected from 
Poisson counting statistics and fin d agreement with the theoretical 
expectations (as described by e.g.. lRoss et al.ll2009l) . and also find 
that the diagonal elements of Cjk(AA'', AA') are approximately Pois- 
sonian, independently of the number of jackknifes used. In Fig. [3] 



http://healpix.jpl.nasa.gov/ 



3 THEORETICAL MODELLING OF CLUSTER 
STATISTICS 

3.1 Tlie cluster mass function 



IPress & Schechtej ( [l974l) first calculated the expected number of 
dark matter haloes of a given mass and redshift. This was better de- 
scribed by the excursion set approa ch dBond et al.lll99l|) an d gen- 
eralised to non-spherical model by ISheth & Tormen ( 1999h . who 
calibrated their mass function with A'-body simulations. Even more 
accurate est imations are achieve d with a full fitting to A'-body sim- 
ulation (i.e. Ijenkins et al.ll200ll) . The current state-of-the-art halo 
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mass function has been estimated bv lTinker et alj l l2008ll201oh : tliis 
mass function is valid over wide redshift and mass ranges. 

The expected number density of virialised dark matter haloes 
as a function of mass and redshift can be expressed as 



dn{M, z) 
dlnM 



d In cr 



dM 



(4) 



where p,„ is the mean matter density of the Universe, v = 
6clcr{M, z), 6c = 1.686 is the threshold linear overdensity for spher- 
ical collapse in a matter dominated Universe, and a^{M,z) is the 
variance of the linear matter density field a t M = 4nR^pj „ J3. In this 
work we use the mass function given bv [Tinker et al. 1 teOld) for 
cluster mass at R200, with an overdensity of A = 200 in units of the 
mean mass density of the Universe: 



Mv) = 0.368 [1 + [pvY j v-'^e-^"''^ . 
where the parameters evolve in redshift as 



/3 = 0.589 (1 +zf 



-0.243(1 +zr 



-0.729(1 +zr 



7 = 0.864(1 +zr 



(5) 



(6) 



3.2 The mass-richness scaling relation 

In order to perform a cosmological analysis, we need to make some 
assumptions on the scaling relation between the true mass of a clus- 
ter and its observed richness N°^^. We first consider the probability 
of observing member galaxies at R200 for a given true mass M 
of the cluster. We can write this probability as the product of the 
probability of a cluster having true richness A'gjii given the mass M 
and the probability of observing A'"^,'' member galaxies given the 
true richness Ng.^\: 



(7) 



Here p(A^°.!'i Wgai) is assum ed to follow a lognormal distribution as 
suggested bv lLima & Hul ( l2005h 



4 



2ncr} ., 



where 



lnW*-lnW,„ 



(8) 



(9) 



and CTiniy.,., is the scatter in th e mass-ric hness relation dLima & Hul 
2005l:lBattye & \yellej2003h . Following ljohnston et alj | |2007h and 
RozoetalJ ilOld) . we assume the scaling relation to be a power 



law in mass, i.e. 



InM 



lnM2oo|20 + Q'«ln(A'gai/20) 



(10) 



with M200120 the mass of a cluster with 20 member galaxies within 
a radius of Rioo and the slope of the scaling relation. We fit this 
relation by fixing two pivot points in mass M[ = 1.3 • 10'"* Mq and 
M2 = 1.3- 10'^ Mq, while the corresponding richness values In A'l = 
In A^gailMi and In A'2 = In A'gai|M2 are kept as free parameters. 

The statistical scatter around this relation is assumed to be 
constant with redshift and mass for individual clust ers. To get 
an est imate of this quantity is not trivial; however IRozo et aU 
( l2009l) used weak lensing and X-ray observations together with 
the maxBCG richness to have three different mass proxies. By de- 
manding c onsistency betwee n the X-ray and weak lensing mea- 
surements, [Rozoeral] lloO^ found cri„M|«„,j = 0.45 ^[j |g, which is 



the scatter in mass given the richness. For our cosmological analy- 
sis described in Section|4]we need to place a prior on the converse 
scatter, <T]„iy ^i^. The two quantities can be readily related to each 
other by invoking the scaling relation of Eq. ilO\ . which results in 

0"lnM|JV„,,| = Q'jV • 0"liiWg,,||M- 

3.3 Modelling galaxy cluster counts and total masses 

In order to predict the number of observed galaxy clusters for an 
observed richness N°^^, we can use the probability distribution and 
scaling relation defined in the previous section. The cluster average 
number density within a richness bin [Nf^^ . , A'"^,'';^,] is given by 

dn 



J^ ' dlniVg"^^ J rflnyVga, 



d In 



dn 1 

^'"^g"! o [erfc(xi) - erfc(.Y,+i)] , (11) 

rflnA'gai 2 



where x,- = x{N°^2i)-' 
dn 



dn 



rflnM 



dn 



■ CfN , (12) 

lilnA'gai <ilnM dlnA^ga, dXnM 

and we have employed the scaling relation of Eq. jlOt . The total 
number of predicted galaxy clusters within a richness bin can be 
calculated as 



AA',- = ^a 



d^-v 



(13) 



where ACl is the survey sky coverage and d^V/dz/dCl the volume 
element. The cosmology dependence is driven by the mass function 
and by the comoving volume element. 

We can write similar expressions to Eqs. (II U and ( 113) for 
the total mass of clusters. The average total mass (nm), contained 
within the same richness bin can be obtained as in Eq. ([IB by 
weighting the integrand by the mass, estimated via the mass- 
observable relation. The total mass of clusters within a richness 
bin is then 



d^V 

dz (nm)i , 

dzdn 



(14) 



where /? is an additional nuisance parameter introduced to account 
fo r possible mismatc h with the WL masses, as discussed above and 
in lRozo etan i l2010l) . 



3.4 Clustering of clusters 

Galax y clusters can be studied as tracers of the LSS dMo et al.l 
Il99d) . corresponding to the highest-density regions of the dark 
matter overdensity field 6(x , z). If we assume linear theory and a lo- 
cal deterministic halo bias dFrv & Gaztanag il993h . then the dark- 
matter haloes overdensity is Si,(x, M, z) = fco + bi,(M, z) 6(x, z). The 
local bias assumption breaks down in the case of primordial non- 
Gaussianity (see Section lsTsl l. As the effect of baryons is negligible 
for the clustering properties of the clusters, in the following we use 
the naming 'cluster' and 'halo' interchangeably. 

Halo bias 

The halo bias can be derived from a theor y of the mass func- 
tion via the peak-background split formalism JCole & Kaisejl989l ; 
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IMo& Whit j 19961) . This method gives a prediction for the halo bias 
in Lagrangian space b''{M, z), whi ch can be evolved in to the observ- 
able Eulerian space as b=l+b'' ( IMo& Whitelll996h . considering 
linear perturbation only, spherical collapse and no large-scale ve- 
locity bias. We assume the bias is scale-independent (except for 
the modifications in the presence of PNG). At linear order, the La- 
grangian bias is 



fc'-(M,z) : 



f'(y) 
f(y)<r' 



(15) 



where the derivative of the mass function is taken with respect to 
V, and the mass and redshift dependences of v and cr are implicit. 
When using the Tinker mas s function and keep ing the leading order 



When using the Imker mas s tunction arid keep ii 
terms, the Eulerian bias is ( [Tinker et al .120 lOl) : 



*r(M,z) : 



2(f>/dc 



(16) 



Sc 1 + |j8v]2# 

where all parameters are defined as in Eq. (|6]. We show in Fig. |4] 
the Tinker halo bias briM) as a function of halo mass M at z = 0.2, 
wh ich is the mean redshift of the maxBCG clusters, compared with 
the lPress & Schechted i ll 974 (PS) case fops = 1 - 1 /S^ + SJcr^, and 
with the scale-independent part of the bias in the presence of PNG 
described below, for our combined best-fit model. 

We obtain the average cluster bias b ov er the m ass range of 
our data by weighting with the mass function jLima & Hu 2005): 



Kz) 



dn(M, z) 

d\nM -^--^bT{M,z), (17) 
d\nM 



where the normalisation factor AA' is the full integral of the mass 
function in the observed range. 



Power spectra 

We then define the observable clustering statistics in Fourier space 
(denoted by a tilde). As we consider linear scales only, the halo- 
halo power spectrum Pu, can be related to the linear matter power 
spectrum Pii„ as 



PM,(k,M,z) = b\M,z)PUk,z) 



■■ b^{M,. 



■)D\z)PUk,Q), (18) 



where D(z) is the linear growth function. We integrate the mass 
dependence by weighting the bias as described in Eq. ( I17t and we 
compute all quantities at the mean redshift of our cluster sample, 
z - 0.2. This is further justified by observing that the growth of 
biz) is compensated by a similar drop in D(z); we have checked 
that for our fiducial cosmology, in the observed range 0.1 < z < 0.3 
the variation of b{z) D(z) is at the percent level. 

Before fitting models to the data, the following four effects 
have t o be taken into account, following the description by iHiltsil 
( I2OIOI) : the photo-z errors, which are responsible for a damping of 
the spectrum on small scales; the convolution with the survey win- 
dow, which suppresses the power on large scales; the non-linearities 
which add power on small scales, and the redshift-space distortions. 
The total observed power spectrum P°'"' is modelled as 



P°^'(k) : 



I 



dlriKK^ Pfn^(K)K(K,k), 



(19) 



where K(K,k) is the kernel accou nting for the effect of the finite 
survey area, given in Eqs. (9-1 1) o f lHutsil ( l2O10l) . and Pnl contains 
the remaining corrections and the effect of non-linearities. In our 
analysis we only use data up to fc„,„ = 0. 15 /? Mpc"' and we follow 
lHutsil ( l2010l) . modelling the effect of residual weak non-linearities 
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bp3(M) 




b,(M) 


. . 1 


- - b,„(M,f,,=400) ■ 



M [h-'Mg] 

Figure 4. Mass dependence of the lineal' halo bias at J = 0.2 for three mass 
functions: Press-Schechter (cyan dotted), Tinker (black solid) and modified 
LoVerde mass function in the presence of PNG (magenta dot-dashed), with 
/nl = 400. Cosmology is fixed to our combined best-fit model. 



with a simple effective fitting function with one free parameter ^nl- 
All these contributions lead to: 



Pt^dk) = (fo^'f (1 + ^nlA:''') s{k) PUk) 



2 1 2 



(20) 



Here the bias is rescaled as b"'"' = h ■ B, where we include a nui- 
sance parameter B to represent the uncertainty on the bias derived 
from the mass function. We model the photo-z smoothing with a 
corrective factor 



m = 



2cr.k 



erf (cTj k) , 



(21) 



assuming that photo-z errors follow a Gaussian distribution with 
dispersion 6z and corresponding spatial smoothing scale cr- = 
Sz ■ c/Hq. The last term of Eq. i20i is the correction due to 
redshift space distortions (R SD), for which we assume j8;(z) - 
Q.°fHz)/b°^%z) jKaiseJI 19871) . We have checked that the RSD cor- 
rection changes at most at the percent level if we calculate it at 
the limits of our redshift range. We finally take into account the 
Alcock-Paczynski effect jAlcock & Paczvnskilll979l) : we rescale 
the full theoretical power spectrum with respect to the cosmology 
used to convert redshifts to distances in the measurements (denoted 
by the superscript 'fid' ) , assuming a single isotropic dilation applies 
( lEisenstein et alj2005l ; lHutsill2006cl) . i.e. 



Here cv 



P°'"(k) : 



[C\\ C^j , C|| 



(22) 



//"VH, c_L = Da/D^/, Da is the an- 



gular diameter distance, where all quantities are calculated at the 
mean redshift z. 

We show in Fig.[2]the full cluster power spectrum P°^%k). The 
different lines correspond to the theory curves for our combined 
best-fit cosmology (red solid) and for two other models (blue and 
green) chosen to be at the 2cr limit of the marginalised n„ - erg 
contour, compared with the data. 

3.5 Primordial non-Gaussianity 

We extend our model to constrain PNG from both bias and abun- 
da nces of the maxBCG clust e rs. Br iefly, following e.g. the notation 
of iGiannantonio & Porcianil ( |2010|) . we introduce the /nl param- 
eter to quantify the amount of PNG in the simplest local, scale- 
independent case as 



0(x, z.) = <f>(x, z.) + /nl [vHx, z.) - (<p'Kz,)] 



(23) 
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where <I) is the Bardeen's potential at a primordial redshift z, and (fi 
is an auxiliary Gaussian potential. Throughout this paper, we define 
/nl by writing the previous equation at early times (i.e. z, ~ 1 100). 
The potential power spectrum can be approximated by its Gaus- 
sian part, Po(fc) - P,p(k), at leading order in /nl and neglecting 
trispectrum corrections. The matter perturbations are related to the 
primordial potential by the Poisson equation: 



6(k,z) = a(k,z)^(k,z,), 



with 



a(k,z) = 



2c'-k^T(k)D(z)g(0) 
3n,„Hlg(z.) 



(24) 



(25) 



Here T(k) is the transfer function and g(z) oc (1 + z)D(z) is the 
growth function of the potential. We can then write for the matter 
power spectrum P 



P(k, z) = a\k, z) P«,(k, z.) - aHk, z) P^{k, z,) ; 



(26) 



we consider linear theory only, so we assume P = Pij„. 

The halo mass function is modified in the prese nce of PNG as 
it gain s a dependence on the skewness. We use the ILo Verde et al.l 
( l2008l) mass function (LV), which was obtained by using the Edge- 
worth expansion, and is given by 



o 



I dS-i cr 

dmcr o 



(27) 

where S3 is the skewne ss of the matter density field defined as in 
IPesiacgues et al.l ^20091) (the mass dependence is implicit). To im- 
prove the agreement with A'-body simulations, and for consistency 
with the rest of our analysis, we replace its Gaussian limit from the 
PS to the Tinker form, so that we use the rescaled form defined as: 



/lv(v) : 



My) 



/lv(v) . 



(28) 



/ps(v) ■ 

where fps = ^J2/n v exp is the PS mass function. 

We apply the peak-background split formalism and analyt- 
ically derive the Lagrangian linear halo bias associated to the 
Lo Verde mass function using Eq. dlSt as 



cr- 



1 6 + SM4v' -4v) + 2^<rv 
0-6V + SMy* - - 1) + £^cr(v'- - 1) 



(29) 



while the Lagrangian bias associated to the rescaled mass function 
ofEq. (HHlis 



bty(y) ■ 



ft^vM + ixM-ftpsM, 



(30) 



which is the bias we use in the following. 

In the presence of PNG, the halo density perturbations depend 
not only on the dark matter perturbations 6, but also on the potential 
(p. The latter can then be related back to the density in Fourier space 
by using the Poisson equation, so that the effective Eulerian bias 
can be written at the mean redshift z - 0.2 as 



fteff (M, k, /nl) = b(M, /nl) + Ab{M, k, /nl) , 



(31) 



where the bias contains implicitly a scale-independent correction 
Shifts) = b{M, /nl) ~ b{M, 0) with respect to the Gaussian case, 
following from the difference in the mass function, and the scale- 
dependent part is 



2/nl<5,Z7^(M,/nl) 



(32) 



aikrz) 

As in the Gaussian case, we average the bias over the masses 




k [hMpc" 



Figure 5. The effect of PNG on the cluster power spectrum. We compare 
the data (black points) with the predictions for the best-fit model to our 
data with /nl = -46 (red solid) and for two cases with /nl = -200 (blue 
dashed) and /nl = 200 ( green dot-dashed). The dotted Hne at ^max — 

0.15/! 

Mpc"' represents the smallest scale we use in the analysis. 



in our catalogue following Eq. ( I17t . In order to take into account 
the uncertainty on our assumption of a mass function, we also in- 
troduce a nuisance parameter B as in Section 13.41 which rescales 
the bias as b"^" = b ■ B. 

The scale-independent correction (5/?(/nl) is small, easily con- 
fused with other normalisation effects, and relies on the assumed 
form of the mass function and the peak-background split method. 
For these reasons, it is worth ensuring that the results do not depend 
on this contribution. We make sure this happens in our case because 
any constant rescaling of the bias can be equally explained by ei- 
ther a change in the nuisance parameter B or a change in /nl. But 
since a model with /nl also predicts the scale-dependent bias, 
it will be favoured only in case such a feature is indeed observed 
in the data, otherwise the S ^ 1 model will be assigned a better 
likelihood. In practice, we impose some Gaussian priors centred on 
S = 1, but we have checked that the results on /nl do not depend 
significantly on this choice. 

We show in Fig.[5]the full power spectrum P°''^{k) in the pres- 
ence of PNG for a choice of /nl values, compared with the data. 
The scale-dependent bias induced by PNG is visible on large scales 
(small k), while the smaller scale-independent contribution can be 
seen on small scales (large k). Note that the survey window con- 
volution of Eq. l |19l l partially suppresses the effect of PNG on the 
largest scales, which become comparable with the survey volume. 



4 LIKELIHOOD ANALYSIS AND RESULTS 

We use cluster counts, WL masses and the cluster power spectrum 
to fit the richness-mass relation and constrain cosmology simulta- 
neously. In particular, our observables are: 

(1) Cluster counts AA', divided into 10 richness bins; 
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Type 


Symbol 


Definition 


Prior without CMB 


Prior with CMB 


n Q m nl no \7 


h 




0.7 


[0.4, 0.9] 






ScclIlIT spectrsl index 


96 


[0 5 15] 






Rqitoti pnprfT^/ Hpncit^? 


04397 


|_w.O 1 , 






Cold dark matter energy density 


[0.1,0.9] 


[0.1,0.9] 




lo2(10'*^A.) 


Amplitude of piimordial perturbations 


[0.1,6.0] 


[0.1,6.0] 




■J- 


Optical depth 


0.09 


[0.01, 0.125] 




,/nl 


Primordial non-Gaussianity amplitude 


[-900, 900] 


[-900, 900] 


Scaling relation 


\x\N\ = InA^gailMi 


Richness at Mi = 1.3 ■ IO^Mq 


[1.0,4.0] 


[1.0,4.0] 




\nN2 = In //gal 1^2 


Richness at M2 = 1.3 ■ IO'^Mq 


[1.0,4.0] 


[1.0,4.0] 






Scatter 


0.45 ±0.1 


0.45 ±0.1 


Nuisance 


B 


Weak lensing mass measurements bias 


1.0 ± 0.06 


1.0 ± 0.06 




B 


Scatter on bias derived from mass function 


1.0 ±0.15 


1.0 ±0.15 






Non-Hnear correction to power spectrum 


[0.0, 50.0] 


[0.0, 50.0] 




0"; 


Photo-Z errors 


[0, 120] 


[0, 120] 




Asz 


Amplitude of CMB SZ template 


1 


[0,2] 


Derived 


n,„ 


Total matter energy density 








0"8 


Amplitude of density perturbations 







Table 1. Parameters used in the analysis and their assumed piiors. In the prior coluinns a single nuinber 11 stands for a fixed value, [a, b] stands for a flat prior, 
fi±(T means a Gaussian prior of mean fi and standard deviation a. 



(2) Total mass of clusters ANM, divided into 6 richness bins; 

(3) Cluster power spectrum P"^", divided into 18 k bins. 

The covariance matrix we use for the cosmological analysis is 
composed by the parts discussed in Section |2] In addition to the 
cluster data we als o use the CIVIB power spectra from WIV1AP7 
jLarson et al.ll201 iT) . in the cases specified below. 

We assume a flat ACDM cosmological model. When using 
cluster data alone we fix the Hubble parameter h = 0.7, primordial 
spectral index its = 0.96 and baryon density Qi, = 0.044, as these 
parameters are not easily constrained in this case; we relax these 
assumptions when adding external CIVIB data. Note that we need to 
fix the spectral index of scalar density perturbations because of the 
small range in scale which our mass range corresponds to. 

We then perform Bayesian parameter estimation by run- 
ning IVIonte Carlo IVIarkov Chains (MCMCs), using Metropo- 
lis sampling with a modified version of the COSMOMC code 
jLewis & Bridlell2002h . In Table[T]we list all the parameters of the 
analysis, including their assumed priors. We estimate the posterior 
probability distributions in the following cases: 

(i) Counts only: 6 free parameters [Q.^ log(10"'A.,), InA'i, 
lnA'2, o"inM|A'„,,pyS]5 without the cluster power spectrum; 

(ii) Counts with /nl: V free parameters [Q.^, log(10"'Aj), InA'i, 
In A'2, a-]nMiN„.,, , yS. ,/nl]> without the cluster power spectrum; 

(iii) Counts+PC*:): 9 free parameters [Q^, logClO'^A,,), InA'i, 
lnA'2, CTinMiA",,,!./^' ?NL. CTz' ^1' ^ith the cluster power spectrum; 

(iv) Counts+P(A) with /nl: 10 free parameters [Q.^, 
log(10'°A,), InAi, lnA'2, cri„M|w,,,, /3, ?nl, cr-, B, /nl], with 
the cluster power spectrum; 

(v) CIMB only: 7 free parameters h, t, n^, A,-, Sl^, 
logClO'^A,)], with CIVIB data only; 

(vi) CMB+clusters: 14 free parameters [Sli,, h, t, n,, A,-, Q.^, 
logClO'^A,), InAi, lnA2, cr,„„|A,_^_,,, /3, ^nl. ct-, S], with CMB and 
all cluster data; 

(vii) ClMB+clusters with /nl: 15 free parameters [Q.i„ h, r, n„ 
A,,, n„ logClO'^A,), In Al, lnA'2, (Ti^MiNg^, 9nl, tr-, B, /nl], with 
CMB and all cluster data. 



4.1 Results 

We summarise our results in Table |2] and we show in Figs. |6] [7] 
[8]and|9]the 2D 68% and 95% marginalised confidence regions for 
different pairs of parameters in our analysis. The colour scheme is 
the same for all figures: blue contours refer to runs with counts and 
WL mean masses data only, green contours include in addition the 
cluster power spectrum data, while orange contours also include 
CMB data from WMAP7. 

The joint constraint in the CI,,, - erg plane in Fig.[6]displays the 
typical degeneracy from cluster counts: the counts increase with 
increasing Q,,, and erg values, hence any increase in SI,,, must be 
balanced by a decrease in erg (and viceversa), to keep the abun- 
dances at the observed values. The constraints on indivi dual param- 
eters w ith counts and masses only are consistent with IRozo et al.l 
( l2010h . and we find fi„, = 0.25 ± 0.06, erg = 0.80 ± 0.06 (Icr er- 
rors throughout), while the errors are improved by a factor between 
1.5 and 3, depending on the parameter, when adding the maxBCG 
power spectrum: in this case we obtain Cl„, = 0.215 ± 0.022, erg = 
0.84 ± 0.04. Combining then these results with the CMB data, the 
constraints shrink to CI,,, = 0.255 ± 0.014 and erg = 0.790 ± 0.016: 
the contribution of the CMB tightens the en'ors by a further factor 
of two. 

In Fig.Qwe show the marginalised posterior probability con- 
tours of the scaling relation parameters InAi and lnA2. Con- 
straints on individual p arameters us i ng cou nts and masses only are 
again compatible with IRozo et al.l ( 1201 Oh (InA'i = 2.44 + 0.11, 
lnA'2 = 4.16 ± 0.15), while errors are reduced when adding the 
power spectrum, even if less significantly (In Ni = 2.49 ± 0.09, 
In A2 = 4.13 ± 0.13). Combining these results with the CMB data, 
the constraints are almost identical, as the CMB is not sensitive to 
the clusters scaling relation. Our c onstraints on the sc aling relation 
scatter (Ti„^^ are in agreement with lRozo et al. l (l2O10h . and they are 
not improved by the addition of power spectrum and CMB data. 

Figs.[8]|9]show the constraints on /nl and its degeneracies with 
Q.„, and erg. First we can see that, when only counts and masses are 
used, the constraints on /nl are weak as expected. The situation 
improves when adding the cluster power spectrum: in this case, 
the constraints are tighter, and we observe a positive correlation 
between /nl and fi,,, and an anti-correlation with erg. To under- 
stand this behaviour, we notice that if we increase (decrease) Q„,, 
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Figure 6. Marginalised posterior probability distributions on the parameters 
^2,„ - o"8 for the runs using Counts only (blue), Counts+P(i) (green) and 
Counts+P(A)+CMB (orange), at 68% and 95% confidence levels. 
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Figure 8. Marginalised posterior probability distributions on the parame- 
ters /nl ~ fi/n for the runs using Counts with /nl (blue), Counts+P(fc) 
with /nl (green) and CMB+clusters with /nl (orange), at 68% and 95% 
confidence levels. 



z 

s 



4.7 
4.6 
4.5 
4.4 
4.3 
4.2 
4.1 
4 
3.9 
3.8 
3.7 



Counts+P(k)+CMB 



Counts only 




Counts+P(k) 



2.1 



2.2 



2.3 2.4 
In N 



2.5 



2.6 



2.7 



Figure 7. Constraints on the scaling relation parameters for the runs using 
Counts only (blue), Counts+P(A:) (green) and Counts+P(A:)+CMB (or- 
ange), at 68% and 95% confidence levels. Notice that InWi = InWgailMi 
and In 7^2 = In A'g„||M2, where Mi = 1.3 ■ 10'-*Mo and M2 = 1.3 ■ IO'^Mq. 
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Figure 9. Marginalised posterior probability distributions on the /nl - o"8 
plane for the runs including Counts with /nl (blue), Counts-HP(ft) with 
/nl (green) and CMB-Hclusters with /nl (orange), at 68% and 95% confi- 
dence levels. 



the peak of the power spectrum decreases (increases) while also be- 
ing shifted to higher (lower) values of k, while erg simply changes 
the overall normalisation. As described above, an increase in /nl 
causes a boost in the power spectrum on large scales (small k), so 
that CTg needs to decrease to compensate a higher /nl: this is exactly 
what is shown in Fig. [9] In addition to this Q.,„ should increase to 
compensate a higher /nl: this can be seen in Fig. [8] We also see 
that the addition of the CMB power spectrum data improves the 
constraints on Q,„ and erg and only indirectly reduces the bounds 



on /nl, since PNG simply affects the higher-order statistics of the 
CMB. 

Our constraints on PNG are /nl = 12±157(lcr) (without 
CMB) and /nl = 194 ± 128 (with CMB), which are statistically 
compatible with zero and with each other. The shift in the mean be- 
tween the two results is clear by looking at Figs.[8]|9] the addition 
of the CMB favours lower values of erg (and higher values of Si,,,), 
thus shifting the favoured /nl values in the process. While not com- 
petitive with results from the CMB bispectrum or from combined 
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Params Counts only Counts+P(ft) Clusters+CMB 





no/NL 






+/nl 




+/nl 




0.25 ± 0.06 


0.25 ± 0.06 


0.215 ±0.022 


0.209 ± 0.022 


0.255 ± 0.014 


0.248 ±0.013 


0"8 


0.80 ± 0.06 


0.77 ± 0.07 


0.84 ± 0.04 


0.85 ± 0.05 


0.790 ±0.016 


0.780 ±0.016 


InWi 


2.44 ±0.11 


2.44 ±0.11 


2.49 ± 0.09 


2.49 ± 0.08 


2.44 ± 0.08 


2.43 ± 0.08 


\nN2 


4.16 ±0.15 


4.15 ±0.15 


4.13 ±0.13 


4.11 ±0.12 


4.19±0.11 


4.15 ±0.11 


0"lnM 


0.38 ± 0.06 


0.38 ± 0.06 


0.36 ±0.06 


0.37 ± 0.06 


0.378 ± 0.059 


0.38 ± 0.06 


/? 


1.00 ±0.06 


1.01 ±0.06 


1.01 ±0.06 


1.01 ±0.06 


1.01 ±0.06 


1.00 ±0.06 


<?NL 






26± 10 


27 ± 10 


14 ±6 


16±7 


(T- 






46± 12 


42 ±8 


43 ± 10 


31 ±5 


B 






1.07 ±0.13 


1.01 ±0.15 


1.19±0.11 


1.00 ±0.14 


/nl 




282 ±317 




12 ± 157 




194 ± 128 



Table 2. Marginalised mean values and Icr errors on the cosmological parameters, for the runs Counts only. Counts with /nl, Counts+P(A:), Counts+P(fc) 
with /nl, CMB+clusters and CMB+clusters with /nl. Note that £1,,, and erg are derived parameters in our analysis. 



analyses of multiple galaxy surveys, it is interesting to find such 
constraints independently and for the first time with the clustering 
of galaxy clusters. 

Since we restrict our analysis to nearly-linear scales, by im- 
posing the data cut at fc„,„ = 0.15 A Mpc"', we are not expect- 
ing strong constraints on (/nl- The constraints we found are i ndeed 
broad and in agreement within the errors with the results by iHutsil 
who found ^nl = 14.2 ±2.8 when marginalising over three 
parameters only: we obtain (/nl = 26 ± 10 and (/nl = 14 ± 6 when 
also using CMB data. 

It is also worth mentioning the results on the B parameter, 
which was introduced to take into account the uncertainty in the 
bias expression derived from the mass function. As this parameter 
allows an arbitrary constant rescaling of the bias, it also has the 
desirable property of cancelling the effect of the scale-independent 
bias correction 6b(fm), as described in Section [33] To check that 
the Gaussian prior we are imposing B = 1.0 ± 0.15 is large enough 
for both purposes, we made an additional run replacing it with a 
flat prior B e [0.0001, 5]. In this way, we obtain nearly unchanged 
results on /nl- 



number counts and masses only jRozo et ahlboiOl) , obtaining e.g. 
for the fluctuation amplitude erg = 0.84 ± 0.04 (Icr) and for the 
matter content Q.,„ = 0.215 ± 0.022 (Icr). These are further tight- 
ened by a factor of 2 by the addition of the CMB data. On the other 
hand, we found that the errors on the scaling relation parameters 
are consistent with previous works, but not significantly improved 
by the addition of the cluster power spectrum. 

As an interesting application, we also tested primordial non- 
Gaussianity, which is constrained through the non-Gaussian halo 
mass function and the scale-dependent cluster bias. Assuming de- 
viations from Gaussianity at the three-point (skewness) level of 
the local type, we obtained /nl = 12 ± 157 (Icr) from our com- 
bined data set, which shifts to /nl = 194 + 128 (Icr) when includ- 
ing the WMAP7 CMB data. While not competitive with the CMB 
bispectrum and with results from combined galaxy clustering data 
sets, this result is consistent with them and was obtained using the 
maxBCG cluster data alone. Our results can be seen as a proof of 
concept towards a full joint analysis of the LSS, consistently in- 
cluding both galaxies and clusters as dark matter tracers, to achieve 
the full potential of the upcoming galaxy surveys such as the Dark 
Energy Survey and the Euclid mission. 



5 CONCLUSIONS 

In this work we have investigated the cosmological implications 
of the optically-selected SDSS maxBCG galaxy cluster data, ob- 
taining extended cosmological constraints with respect to previ- 
ous works. We considered the number counts of clusters in rich- 
ness bins and the weak lensing mass estimations, including the 
respective covariances, for a cross-calibration of the scaling rela- 
tion. We then combined such data for the first time with a mea- 
surement of the redshift-space power spectrum of the same clus- 
ters. In the modelling we included an effective treatment of the 
non-linear contribution, photo-z smoothing, redshift space distor- 
tions and Alcock-Paczynski effect. We only considered quasi-linear 
scales atk < ky„.,^ = 0.15 h Mpc"' to be conservative. We estimated 
the off-diagonal terms of the counts-clustering covariance matrix 
with a jackknife method applied on both data and A'-body simula- 
tions, and found consistently that such contributions are negligible. 

We then performed a full MCMC analysis of the posterior 
probability distribution of cosmological parameters given the full 
data set. By thus combining the one- and two-point statistics, we 
achieved a factor 1.5 to 3 improvement on the errors on the cos- 
mological parameters, if compared with previous analyses using 
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